CDS

Accession Number TCMCG075C19371
gbkey CDS
Protein Id XP_017978845.1
Location join(17830642..17830714,17831097..17831269,17831619..17831656,17831732..17831811,17832283..17832350,17832983..17833301,17834034..17834146,17834231..17834296,17834619..17834709,17834866..17834933,17835391..17835428,17835594..17835780)
Gene LOC18595979
GeneID 18595979
Organism Theobroma cacao

Protein

Length 437aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018123356.1
Definition PREDICTED: uncharacterized protein LOC18595979 isoform X2 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category K
Description isoform X1
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
KEGG_ko ko:K18666        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGACAAGAAGAAAAGAAAGATTGGCACTCCTGTATGGAAGCCAGTATGTACTCAACCTAGTTCCCTCGAAGAGCATGCCATAAAGGATGTGATGGTTGAGTCTGAAAATGGAAGTGAAATGCAAGAAGTGAATGAAGTTACAAATGCAACTGTTAGTCCTAAGGCTTTGGAGGATGATATCGAAGATGGAGCGTTAAAAGAAGAGCCAGTGCTTTCAGATGAAAAGCACTCACTGTCTGTTGAGATTGGTGCATCTTTAATTCAATTTGTCAGAGGAAAAGAAGGATCTACAAAGGAAAAGATTGAAAAGGAGATGGGAGTTCAGATTATACTTCCATCATCAAAGCAGGAGGATTCTATTATGATTGAAGGCACTTCAGCTGATAGTGTAACTAAAGCTTCAAAAGAGATACAACATATAATCGATGAGGCCGTTAAGACCCCAAGTCTTGACTACTCGCACTTTGTCTCTCTTCCTTTGGCTATACATCCTGAGCTGGTTGACAAGCTTGTCGACTTTCAGAACTCCATATTGGGAATTAGTGATGCCTGTGTAGATGACAATCAGGAAGACAACTCAGATGGAGATACTTCTGGAGATGAAGCTCAGGAGCAGCAGTTAGGTAAAGGACCTGACATGGCAGTTGAAGTTAAAGTTTCTGATGACAAGAAAAGTGTTAAGGTGGATGTAAGCGGCATTCCTCTTGTTAGTTATGTACCTAAAGAATCAAAGTCTTCTAATTTATCAGACTTGGGAATTGAAAAGTCCATATTTATTAAACCTAAAACATTTCACTTGACGGTGCTCATGTTGAAGTTGTGGAACAAAGAAAGAGTTAATTTAGCAGCTGAGGTATTGAAGAGTATCTCCTCAAAAGTGATGGATGCTTTGGATAATCGACCTATATTTGTAAGACTCAAGGGTCTGAATTGCATGAGAGGTTCTTTGGCCAGAGCCCGAGTTGTTTATGCTCCTGTGGAAGAAATTGGCAGTGAAAATCGACTTTTATGTGCCTGTGAAGTTATTATCAATGCTTTTGTTGAGGCTGGGCTTGTTCTAGAGAAAGATGCTAGGCATGAGTTAAAGTTGCATGCCACTGTGATGAATGCAAGGCATAGAAAAAGGAAGGGAAAGAGGGGAAAGTTTGATTCCTTCGACGCACGAGGCATCTTCAAGCAGTTTGGATCTGAGGAATGGGGCGAGTATCTCATCCGTGAAGCTCATCTTTCACAAAGATTCAAGTTTGATGAGAATGGTTATTACCATTGTTGTGCTTCAATACCTTTTCCTGAAAACATGCAAGTTGACTGA
Protein:  
MDKKKRKIGTPVWKPVCTQPSSLEEHAIKDVMVESENGSEMQEVNEVTNATVSPKALEDDIEDGALKEEPVLSDEKHSLSVEIGASLIQFVRGKEGSTKEKIEKEMGVQIILPSSKQEDSIMIEGTSADSVTKASKEIQHIIDEAVKTPSLDYSHFVSLPLAIHPELVDKLVDFQNSILGISDACVDDNQEDNSDGDTSGDEAQEQQLGKGPDMAVEVKVSDDKKSVKVDVSGIPLVSYVPKESKSSNLSDLGIEKSIFIKPKTFHLTVLMLKLWNKERVNLAAEVLKSISSKVMDALDNRPIFVRLKGLNCMRGSLARARVVYAPVEEIGSENRLLCACEVIINAFVEAGLVLEKDARHELKLHATVMNARHRKRKGKRGKFDSFDARGIFKQFGSEEWGEYLIREAHLSQRFKFDENGYYHCCASIPFPENMQVD